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5 [BRIEF DESCRIPTION OF THE DRAWINGS] 

Fig. 1 shows the MALDI-TOF mass spectrum of the 7mer when the 2741 st base 
of the 4 th intron of human maspin (serpinbS) gene is normal (C/C). 

Fig. 2 shows the MALDI-TOF mass spectrum of the 13mer when the 2741 st base 
of the 4 th intron of human maspin gene is normal (C/C). 
10 Fig. 3 shows the MALDI-TOF mass spectrum of the 7mer when the 2741 st base 

of the 4 th intron of human maspin gene is hetero (C/T). 

Fig. 4 shows the MALDI-TOF mass spectrum of the 13mer when the 2741 st base 
of the 4 th intron of human maspin gene is hetero (C/T). 

Fig. 5 shows the MALDI-TOF mass spectrum of the 7mer when the 2741 st base 
1 5 of the 4 th intron of human maspin gene is all changed into T (T/T). 

Fig. 6 shows the MALDI-TOF mass spectrum of the 13mer when the 2741 st base 
of the 4 th intron of human maspin gene is all changed into T (T/T). 

Fig. 7 shows the MALDI-TOF mass spectrum of the 7mer and the 13mer when 
the 3597 th base of the 4 th intron of human maspin gene is normal (C/C). 
20 Fig. 8 shows the MALDI-TOF mass spectrum of the 7mer and the 13mer when 

the 3597 th base of the 4 th intron of human maspin gene is hetero (C/T). 

Fig. 9 shows the MALDI-TOF mass spectrum of the 7mer and the 13mer when 
the 3597 th base of the 4 th intron of human maspin gene is all changed into T (T/T). 



[DETAILED DESCRIPTION OF THE INVENTION] 
[OBJECT OF THE INVENTION] 

[TECHNICAL FIELD OF THE INVENTION AND BACKGROUND ART OF THE 
5 FIELD] 

Genetic analysis of an organism is used for disease risk, diagnosis, prognosis or 
disease treatment. For example, mutation analysis on a specific gene of a specific 
person makes it possible to predict the disease risk, thereby inducing prevention of the 
disease. Whether other organisms, such as a virus, etc., that causes a disease has 
10 resistance to medicines is examined through mutation analysis, which results in 
effective treatment. The present invention relates to a method for analyzing genes of 
an organism, and more specifically to a method for determining genetic variation in an 
organism. 

Human genome project enables more broad measurement of disease risk, 
15 diagnosis or prognosis and prediction of reaction on medication. Nucleotide sequence 
analysis of a plurality of individuals presents polymorphic sites, which are referred to as 
SNPs (single nucleotide polymorphisms). The SNP is a variation occurred over the 
specific frequency in a nucleotide sequence of chromosome in organism. In human 
body, SNPs occur every about 1,000 bases. In consideration of the size of human 
20 genome, millions of SNPs exist in human body. Since the SNP is regarded as a means 
for explaining characteristic difference between individuals, the SNP can be used in 
prevention or treatment of disease by examination of cause of disease. 

SNPs discovered by the human genome project show only that the 



polymorphism exist in human body but do not show how those polymorphisms are 
related to disease. In order to reveal the relationship between the SNPs and diseases, a 
comparative analysis of polymorphism pattern represented in healthy people and 
patients, SNP scoring, is required. For precise examination of the relationship between 
the SNP and disease, a large number of SNPs should be analyzed without error. 

The SNP scoring method includes DNA sequencing, PCR-SSCP (Polymerase 
chain reaction - Single stranded conformation polymorphism), allele specific 
hybridization, oligo-ligation, mini-sequencing and enzyme cleavage method. A 
method using a DNA chip is also introduced, but it is not different from the allele 
specific hybridization in principle except that it uses a support to which oligonucleotide 
probe adheres. 

The two classical methods for carrying out DNA sequencing are the Maxam and 
Gilbert chemical procedure and the Sanger method that has been mainly used in recent 
years. The DNA sequencing method is to find out nucleotide sequences of the whole 
or a part of genes rather than to examine genetic variations of specific sites. Since 
genetic variations of specific sites may be identified by examination of nucleotide 
sequences, the DNA sequencing method can be used for the SNP scoring. However, 
the DNA sequencing method is ineffective because adjacent nucleotide sequences that 
do not require examination are read with target SNP to be examined. 

In PCR-SSCP (Orita, M. et al., Genomics, 1989, 5:8874-8879), sequences 
including SNPs to be analyzed are amplified by PCR, and then separated into each 
strand. Thereafter, electrophoresis is performed on polyacrylamide gel. Since the 
secondary structure of DNA strand is changed by difference of one base in sequence, the 



one base variation in sequence may be examined from differences in electrophoresis 
running velocity resulting from the difference of structure. 

The allele specific hybridization is to examine variations by hybridizing DNAs 
labeled with radioisotope to probes attached to a nylon filer, by regulating hybridization 
5 conditions such as temperature. 

The oligo-ligation (Nucleic Acid Research 24, 3728, 1996) is to examine 
sequence variations by performing a ligation reaction under a condition where the 
ligation does not happen if target DNA is non-complementary with template DNAs, 
followed by confirming whether it happens. 
10 The mini-sequencing (Genome Research 7:606, 1997) is developed for SNP 

scoring. This method performs DNA polymerization in a condition that only one base of 
interest can be polymerized and distinguish what the polymerized base is. 

The PCR-SSCP, the allele specific hybridization, the oligo-ligation are 
ineffective methods in analysis of many samples because of its use of polyacrylamide 
15 gel. And the errors resulting from mismatching of probes with undesired sites cannot be 
identified by those methods. 

Although the mini-sequencing is simple and effective in analysis of many 
samples since it was developed for the SNP scoring, the incorrect result by errors of the 
mismatching cannot be still identified, and base deletion and insertion cannot be found 
20 by the mini-sequencing. 

The enzyme cleavage method is also developed for SNP scoring (WO 01/90419). 
In the enzyme cleavage method, sequences to be analyzed are amplified by appropriate 
methods like the PCR. The amplified products include sequences that can be cleaved 



or recognized by two restriction enzymes. The enzyme cleavage method is to examine 
sequence variations by cleaving the amplified products with two restriction enzymes and 
measuring the molecular weight of the cleaved fragments. The enzyme cleavage has 
an advantage of simplicity and rapidity because the molecular weight of the fragments 
5 obtained from the restriction enzyme reaction is measured by mass spectrometry right 
after amplification of genes by PCR. However, the incorrect analysis by errors is not 
identified by the enzyme cleavage method described in WO 01/90419. Although the 
incorrect analysis may be induced when primers are combined in undesired sites during 
the PCR, it is not identified. For example, the primer used to examine polymorphisms 

10 of CYP2C9 may be combined with CYP2C8. In this case, it is difficult to discover 
whether the errors are generated because whether the primer is combined with CYP2C8 
other than CYP2C9 cannot be identified. Also, this method can detect one base 
substitution with other bases, but cannot detect base deletion or insertion. Also, there 
is no method capable of detecting substitution of two or more adjacent bases at the same 

1 5 time. 

[TECHNICAL SUBJECT TO BE ACHIEVED BY THE INVENTION] 

The present invention is designed to solve the problems of the prior art, and 
therefore an object of the present invention is to provide a method for precisely and 
20 effectively detecting mutations of organism. 



[CONSTITUTION OF THE INVENTION] 

In order to achieve the above-described object, there is provided a method for 
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precisely and effectively detecting mutations of organism in an embodiment of the 
present invention. 

Speaking simply, the present invention enables simple and rapid examination of 
mutations in many samples and precise examination of mutation by identification of 
errors resulting from binding of primers at incorrect regions. Moreover the present 
invention provides method of examining two or more sites of mutations, which are 
adjacent within 32 bases at the same time, and detecting deletion or insertion. Especially 
when there are various genotypes in an individual, it can be identified whether 
mutations in different sites simultaneously exist in one genotype or exist with their 
being mixed in different genotypes. For example, a human has a pair (two) of 
chromosomes having the same genetic information. When mutations occur they may 
occur either in the two chromosomes (homo) or in one chromosome (hetero). When 
two or more mutations of adjacent base are all hetero, those mutations may exist in one 
chromosome at the same time or in different chromosomes. Since the two cases may 
have different effect on life, those should be distinguished. In case of virus which 
infects human, various genotypes is mixed. When two or more mutations of adjacent 
bases are all hetero, it should be distinguished whether those mutations exist in one 
genotype at the same time or in different genotypes. 

In order to analyze mutations, the method of present invention amplify desired 
sequence to include sites where the resulting product may be cleaved by restriction 
enzymes and the number of bases in the fragments cleaved by the restriction enzymes is 
designed to be 32 or less and at least one base among them is made to be produced by 
replication of template not primers itself and after the amplified fragments are cleaved 



by restriction enzymes, the molecular weight of the fragments is measured to analyze 
mutations. 

In an embodiment, there is provided a method for detecting mutations, 
comprising: a) amplifying a target polynucleotide using a forward primer and a reverse 
5 primer; b) generating fragments of two or more single-stranded polynucleotides 
including one or more mutation sequences having the size of 2-32 bases by cleaving the 
amplified polynucleotide with restriction enzymes; and c) measuring the molecular 
weight of the cleaved fragments. 

Preferably, the amplified polynucleotide is cleaved to include one mutation 
10 among two or more different mutations in only one single stranded polynucleotide 
fragment and all mutations in the other single stranded nucleotide fragment. For 
example, when the bases A and G in the sequence ...ATG... are mutation sequences, a 
first single-stranded nucleotide fragment generated by restriction enzyme cleavage 
includes only A of the two mutation sequences, and a second single-stranded nucleotide 
1 5 includes both A and G. 

In order to analyze mutations, the method of the present invention amplify 
desired sequence to include sites where the resulting product may be cleaved by 
restriction enzymes and the cleaved fragments have a following structure. 
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The term 'restriction enzyme recognition sequence' is a sequence simultaneously 
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or adjacently recognized by different restriction enzymes, which may not correspond to 
a cleaved sequence. For example, both Fokl and BstFSI recognize the sequence 
GGATG. But, the cleaved sites are next to the 9 th /13 th and 2 nd /0 th bases from the 3' 
end of the recognition sequence, respectively. Both of the two restriction enzymes for 
5 recognizing the restriction enzyme recognition sequence may have the same optimum 
temperature or different optimum temperatures, both restriction enzymes being used in 
the present invention. Among them, the restriction enzymes having different optimum 
temperatures are more preferable. Preferably, the restriction enzymes are a restriction 
enzyme having a relatively low optimum temperature selected from the group consisting 

10 of Fokl, Bbv I, Bsg I, Beg I, Bpm I, BseR I and Bae I, and a restriction enzyme having a 
relatively high optimum temperature selected from the group consisting of BstF5 I, Taq 
I, BsaB I, Btr I, BstAP I, Fau I, Bel I, Pci I and Apo I. Most preferably, the restriction 
enzymes are Fokl and BstF5 I. 

The restriction enzymes having the relatively low optimum temperature are Bae 

15 I(25°C), Fokl, Bbv I, Bsg I, Beg I, Bpm I, BseR I, Mmel I and Ava II(37°C). The 
enzymes having the relatively high optimum temperature are BstF5 I, Taq I (65°C), 
BsaB I, Btr I, BstAP I (60°C), Bel I, Pci I and Apo I (50°C). 

One of the two primers used in PCR amplification comprises a primer binding 
20 sequence 1 , a restriction enzyme recognition sequence and a primer binding sequence 2, 
and the other primer comprises a primer binding sequence 3. 

The 'primer binding sequence' is a sequence that is complementary with nucleic 
acid to be template, but the restriction enzyme recognition sequence may not be 
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complementary with the nucleic acid. The number of bases of the primer binding 
sequences 1 , 2 and 3 should be at least four or more bases to bind with template DNA. 
Since the primer was well combined with template DNA in the size of 8-30 bases, the 
number of bases preferably ranges from 8 to 30. The 'front sequence from mutation' is 
5 a sequence toward 5' of the mutation to be examined. The 'mutation sequence' is a 
sequence corresponding to a mutation to be examined. Substitution, insertion and 
deletion of bases may occur, wherein the number of bases is generally 1 and may be two 
or more. The 'sequence behind mutation' is a sequence toward 3' of the mutation 
sequence. 

10 Preferably, the total number of bases of the front sequence from mutation and 

behind mutation is one or more. The fragments resulting from restriction enzyme 
cleavage should include mutation sequences, and the size of the fragment preferably 
ranges from 2 to 32 bases. More preferably, the size is 12 bases. The reason the size 
of cleaved fragments is limited is that there is a good result in case of the favorable size 

15 of fragments in mass spectrometry analysis. The above number of bases in fragments 
is preferable because the fragment having the size of over 32 bases is too large to 
examine mutations by measuring the molecular weight using mass spectrometry. And 
the fragment only having a base is not preferable because the fragment having only a 
mutation sequence disables identification of binding of primers at incorrect sites. 

20 Since the two restriction enzymes recognize the same or adjacent sites, it is preferable 
that the other restriction enzyme does not activate while one restriction enzyme reacts 
with the amplified product. When the amplified fragments are cleaved with restriction 
enzymes, reaction may be performed consecutively at different temperatures in 
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consideration of the optimum temperatures of two restriction enzymes. Otherwise, the 
fragments may be cleaved with one restriction enzyme, and then with another restriction 
enzyme. Here, the cleavage by the first restriction enzyme should not remove or 
damage recognition sequences or its cleavage sites of the second restriction enzyme 
5 existing in the fragment including the mutation sequence. 

Example 1 . Mutation of the 2741 st base of the 4 th intron of human maspin gene 
Mutation of the 2741 st base (rsl509477; the 61001755th base of chromosome 
No. 18) of the 4 th intron of human maspin (serpinb5) gene that is known as cancer 
10 metastasis inhibition gene is examined. 

1. PCR amplification and restriction enzyme cleavage 
The sequence (5' -> 3') of template DNA is as follows. 

GTT TCACTTGATAAAGCAATAAAATGCTATTCA cAGCT GCATGAGGCT 
ACACCCTTCTTTTGAAT GCAG (SEQ ID NO: 1) 

15 

The underlined sequences are sites where the following primers 1 and 2 are 
hybridized. The bases represented by small letters are 'mutation sequence'. 

Primer 1. 5' - TCACTTGATAAAGCAATAAAAggatgGCTATTCA - 3' 
(34mer) (SEQ ID NO: 2) 
20 Primer 2. 5'- CATTCAAAAGAAGGGTGTAGCCTCATGC - 3' (28mer) (SEQ 

ID NO: 3) 

The sequences represented by small letters are recognition sequences of Fokl 
and BstF5I. 
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PCR buffer (lx), 2mM of MgS0 4 , 200mM of dNTP, Platinum Taq Polymerase 
(Invitrogen, 10966-026) 0.3 15U, 0.5 |aM of primer 1 and 0.5 ^iM of primer 2, and 36ng 
of genomic DNA were added to be 1 8^1 of the total reaction volume. Then, the PCR 
reaction was performed under the following condition. 
5 94°C, 2min. 

94°C, 15sec. 55°C, 15sec. 72°C ? 30sec. (10 cycles), 

94°C, 15sec. 60°C, 15sec. 72°C, 30sec. (35 cycles) 

The genomic DNA was isolated from blood and purified. For example, 
'SDS/protease K' method (Maniatis, Molecular Cloning, A laboratory Manual, Cold 
10 Spring Harbor Laboratory Press, Cold Spring Harbor, 1989) or QIAamp DNA Mini Kit 
250 (Qiagen 51106) could be used in isolation of DNA from blood. When the 
concentration of DNA is low, the DNA can be concentrated by the following method. 
First 1/10 volume of 3 M Sodium acetate (pH 5.3) and 2.5 volume of ethanol were 
added to DNA solution and gently mixed. The resulting solution was left at -20°C for 
15 more than 1 hour, and then centrifuged at 4°C, 13000rpm for 15 minutes. After the 
supernatant was removed, 70% ethanol was added and the resulting solution was 
centrifuged at 4°C, BOOOrpm for 10 minutes. Then, ethanol was dried, and desired 
volume of distilled water was added to the resulting solution. 

The sequence of fragments obtained from the PCR is as follows (5'-> 3'). 
20 TCACTTGATAAAGCAATAAAAggatgGC TATTCArC/Tl AGCTGCATGAG 
GCTACACCCTTCTTTTGAATG (SEQ ID NO: 4) 

AGTGAACTATTTCGTTATTTTcctac CGATAAGTfG/AITCGA CGTACTCC 
GATGTGGGAAGAAAACTTAC (SEQ ID NO: 5) 
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The sites represented by small letters are sequences recognized by Fokl and 
BstF5I, the underlined sites are sequences of fragments generated by restriction enzyme 
cleavage, and the bases represented by brackets are 'mutation sequences'. To the 
5 reactant were added Fokl (NEB R109L) 1U, BstF5I (NEB, V0031L) 1U, 50mM of 
potassium acetate, 20mM of Tris-acetate, lOmM of magnesium acetate, ImM of DTT 
(pH 7.9 @ 25°C). The resulting solution was reacted at 25°C for 2 hours, and 
consecutively at 45°C for 2 hours. 

For optimization of enzyme reaction, the amplified products were reacted with 
10 Fokl and BstFSI at 25°C, 37°C, 45°C, 55°C and 65°C. As a result, 70% of enzyme 
reaction proceeded at 25°C, and more than 90% enzyme reaction proceeded at 37°C in 
case of Fokl. In case of BstF5I, the enzyme reaction didn't proceeded at 25°C. 
Accordingly, the products were preferably reacted first at 25 °C where only Fokl could 
react, and then at over 37°C where BstFSI could react. 

15 

2. Purification and Desalination 

Preferably, DNA fragments were purely isolated from the above solution treated 

with restriction enzymes, and then the molecular weights of the fragments were 

measured. For example, Nucleave Genotyping Kit (Variagenics, USA) might be used. 

20 70^1 of 1M TEAA (Triethylammoniumacetate, pH 7.6) was added to the restriction 

enzyme reaction solution, and left for 1 minute. 70|il of 1M TEAA and 90jal of the 

above mixed solution were added to a Sample Preparation Plate, and then 85jal of 0.1 M 

TEAA was five times passed through the Sample Preparation Plate. The Sample 
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Preparation Plate was centriflxged at lOOOrpm for 5 minutes. Thereafter, the Sample 
Preparation Plate was placed on a Collection Plate, and then 60(il of 60% isopropanol 
was added thereto and passed. When the effluent solution was collected in the 
Collection plate, the Collection Plate was dried at 1 15°C for 75 minutes. 

3. MALDI-TOF Mass Spectrometry 

6|al of MALDI matrix (22.8mg ammonium citrate, 148.5mg hydroxypicolinic 
acid, 1.12ml acetonitrile, 7.8ml H20) was added to the Collection Plate, and then 4|il of 
mixture of the MALDI matrix and effluent solution was placed on an Anchor chip plate 
of MALDI-TOF (Biflex IV, Bruker). It was dried at 37°C for 30 minutes, left at room 
temperature to be cooled for a while, and then subjected to MALDI-TOF analysis. The 
analysis method follows the MALDI-TOF manual. 

When the 2741 st base of the 4 th intron is normal (C/C), the molecular weight of 
fragments obtained after enzyme cleavage is 2135.4 D (7mer) and 4078.6 (13mer) D 
(see Figs. 1 and 2). When the 2741 st base of the 4 th intron is hetero (C/T), the 
molecular weight of fragments is 2135.4 D, 2150.4 D (7mer) and 4078.6 D and 4062.6 
D (13mer) (see Figs. 3 and 4). When the 2741 st base of the 4 th intron is all changed 
into T (T/T), the molecular weight of fragments is 2150.4 D (7mer) and 4062.6 D 
(13mer) (see Figs. 5 and 6). 

Example 2. Mutation of the 3597 th base (rs!396782; 61002611th base of 
chromosome No. 18) of the 4 th intron of human maspin (serpinbS) gene known as 
human cancer metastasis inhibition gene 

The sequence of template DNA is as follows. 
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CTG GAGTATTATCCTTGCAGGCTTGATATGAAG cTTGAAAT TTCTCCC 
CAAAGAGATTTAGTTAACAGGC AAA (SEQ ID NO: 6) 

The underlined sequences are sites where the following primers 3 and 4 
hybridize. The mutation represented by a small letter is a 'mutation sequence'. 
5 Primer 3. 5' GAGTATTATCCTTGCAGGCTTggatgATATGAAG 3' (34mer) 

(SEQ ID NO: 7) 

Primer 4. 5' - GCCTGTTAACTAAATCTCTTTGGGGAGAA 3' (29mer) 
(SEQ ID NO: 8) 

The sites represented by small letters in the above primers are sequences that do 
10 not exist in template DNA, but Fokl and BstF5I recognize them. The experimental 
method including the PCR reaction is the same as that of Example 1 . 

The sequences of fragments obtained through the PCR are as follows (5'->3'). 
G AGT ATT ATCCTTGC AGGCTTggate AT ATG A AG IC/T1 TTG A A ATTTCTC 
CCCAAAGAGATTTAGTTAACAGGC (SEQ ID NO: 9) 
1 5 CTC AT AATAGG AACGTCCGAAcctac TATACTTC IG/A1 AACT TTA AAG AG 

GGGTTTCTCTAAATCAATTGTCCG (SEQ ID NO: 10) 

The sites represented by small letters in the above sequences are restriction 
enzyme recognition sequences, the underlined sites are sequences of fragments obtained 
20 from restriction enzyme cleavage, and the bases represented by brackets ([ ]) are 
'mutation sequences'. To the reactant were added Fokl (NEB R109L) 1U, BstF5I 
(NEB, V0031L) 1U, 50mM of potassium acetate, 20mM of Tris-acetate, lOmM of 
magnesium acetate, ImM of DTT (pH 7.9 @ 25°C). The resulting solution was 
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reacted at 25°C for 2 hours, and consecutively at 45°C for 2 hours. 

When the 3597 th base of the 4 th intron is normal (C/C), the molecular weight of 
fragments obtained from enzyme cleavage is 2209.4 D (7mer) and 3988.6 D (13mer) 
(see Fig. 7). When the 3597 th base of the 4 th intron is hetero (C/T), the molecular 
5 weight of fragments is 2209.4 D 5 2224.4 D (7mer), and 3988.6 D and 3972. 6 D (13mer) 
(see Fig. 8). When the 3597 th base of the 4 th intron is all changed into T (T/T), the 
molecular weight of fragments is 2224.4 D (7mer) and 3972. 6 D (13mer) (see Fig. 9). 

Example 3. Base mutation of tyrosine-methionine-aspartate-aspartate (YMDD) 
10 site of hepatitis B virus DNA polymerase 

Mutations of YMDD site located in DNA polymerase gene of hepatitis B virus 
that causes hepatitis B to human were examined. Resistance to lamivudine that is used 
in a treatment of hepatitis B was generated by the mutation of YMDD site. It was 
known that resistance to lamivudine was generated when methionine (M) that is a codon 
15 No. 552 was changed into valine (V) or isoleucine (I). 

1. PCR amplification and restriction enzyme cleavage 

Hepatitis B virus DNA was isolated from 0.2ml of serum using QIAamp blood 
kit (Qiagen, CA), and 2ul of DNA was used in PCR amplification. 

The sequence of Template DNA (5'-> 3') is as follows. 
20 TTCCCCCACTGTTTGGCTTTCAGTTAT ATG GATGATGTGGTATTGGGG 
GCCAAGTCTGTA (SEQ ID NO: 11) 

The underlined sequences are sites where the following primers 5 and 6 are 
hybridized. 
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Primer 5 (SEQ ID NO: 12). 

5'- TTCCCCCACTGTTTGGCTggatgTCAGTTAT - 3' (31mer) 
Primer 6 (SEQ ID NO: 13). 

5'- TACAGACTTGGCCCCCAATACCACATGATC- 3' (30mer) 
5 The sequence represented by a small letter in the primer 5 is a recognition 

sequence of Fokl and BstF5I, which is not included in template DNA rather artificially 
inserted. The underlined sequence in the primer 6 is an artificially changed sequence 
to prevent recognition by Fokl . 

By using 18ul of reaction solution containing 20mM of Tris HC1 (pH 8.4), 
10 50mM of KC1, 0.2mM of dNTP, 0.4U Platinum Taq Polymerase (Invitrogen, 
10966-026), lOpmol of the primer 5 and lOpmol of the primer 6, the PCR reaction was 
performed under the following condition. 

94°C, 2min. 

94°C,15sec. 50°C, 15sec. 72°C, 30sec. (10 cycles), 
15 94°C, 15sec. 55°C, 15sec. 72°C, 30sec. (35 cycles) 

The sequences of fragments obtained through the PCR are as follows (5'->3'). 

TTCCCCCACTGTTTGGCTggatgTC AGTTATA TGGATCATGTGGTATTGG 
GGGCCAAGTCTGTA (SEQ ID NO: 14) 

AAGGGGGTGACAAACCGAcctac AGTCAATATACCT AGTACACCATAA 
20 CCCCCGGTTCAGACAT (SEQ ID NO: 1 5) 

The sites represented by small letters are sequences recognized by Fokl and 
BstF5I, and the underlined sites are sequences of fragments obtained from restriction 
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enzyme cleavage. The PCR products were mixed with Fokl (NEB R109L) 1U, BstFSI 
(NEB, V0031L) 1U and 10|il of reaction solution (50mM potassium acetate, 20mM 
Tris-acetate, lOmM magnesium acetate, ImM DTT). The mixture solution was 
reacted at 37°C for 2 hours, and then at 45°C for 2 hours. The PCR products might be 
cleaved first by Fokl at 37°C for 2 hours, and then by BstF5I at 45°C for 2 hours. 

2. Purification & Desalination, and MALDI-TOF Mass Spectrometry 

The experiment was performed by the same method as that of Example 1 . 

The calculative size of fragments obtained from enzyme cleavage precisely 
corresponds to the value measured by the actual molecular weight analysis, showing the 
difference of less than 0.1% (see Table 1). 

[Table 1] Presumption and observation mass of oligonucleotide by restriction 



enzyme cleavage of PCR product 







Sequence of presumed fragment 




Mass of 
presumed 
fragment (Da) 




Mass of 
observed 
fragment (Da) 


Genotype 
codon No. 
552 




7mer 13mer 




7mer 13mer 




7mer 1 3mer 


YMDD aTg 
YVDD aTg 
YIDD aTt 
YIDD aTc 
YIDD aTa 




AGTTATa TCcAtATAACTGA 
AGTTATg TCcAcATAACTGA 
AGTTATa TCaAtATAACTGA 
AGTTATa TCgAtATAACTGA 
AGTTATa TCtAtATAACTGA 




2199.4 3997.6 
2215.4 3982.6 
2199.4 4021.6 
2199.4 4037.6 
2199.4 4012.6 




2199.6 3998.0 
2215.9 3982.6 
2199.6 4021.8 
2199.6 4038.0 
2199.6 4012.6 



In the above table, the resolution (difference between the observed mass and the 
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presumed mass divided by the presumed mass) is less than 0.1%. 

Example 4. Mutation of 5' NCR (Non Coding Region) site of hepatitis C virus 
When interferon is used for treatment of chronic hepatitis C, different treatment 
5 effects are shown depending on genotypes of hepatitis C in human body. As a result, 
the examination of genotypes of hepatitis C virus in human body is required before use 
of interferon. In order to find out genotypes, the examination of mutation of 5' NCR is 
useful. In an embodiment of the present invention, a method for analyzing mutations 
of 5 'NCR sites of hepatitis C virus is disclosed. 

10 

1. RT PCR 

RNA of hepatitis C virus was isolated from 0.14ml of serum using QIAamp viral 
RNA Mini kit (Qiagen, CA), and lOjal of the RNA was used in RT PCR amplification. 

Reaction solution containing 0.2mM of dNTP, 0.4uM of primer 2 and 10|al of 
15 RNA was reacted at 65°C for 5 minutes, and left on ice for 1 minute. The reaction 
solution was mixed with 20mM of TrisHCl (pH 8.4), 50mM of KC1, 4mM of DTT, 
0.4uM of primer 1, 100U Superscript III RNase H-Re verse Transcriptase (Invitrogen, 
18080-044), 20U RNaseOUT (Invitrogen, 10777-019), 0.4U Platinum Taq Polymerase 
(Invitrogen, 10966-026). Then, the RT PCR was performed using 25^1 of the resulting 
20 solution under the following condition. 
50°C, 45min., 
94°C, 2sec, 

94°C, 1 5sec. 55°C, 1 5sec. 72°C, 30sec. (35 cycles) 
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72°C, 5min. 

The sequence of template DNA (5'->3') is as follows. 

GCAGAAAGCGTCTAGCCATGGCGT TAGTATGAGT (omitted) 
ACTGCCTGATAGGGTGCTTGCGAG (SEQ ID NO: 16) 
5 The underlined sequences are sites where the following primers 7 and 8 are 

hybridized. 

Primer 7 (SEQ ID NO: 17). 5' - GCAGAAAGCGTCTAGCCATGGCGT - 3 5 
(24mer) 

Primer 8 (SEQ ID NO: 18). 5' - CTCGCAAGCACCCTATCAGGCAGT - V 
10 (24mer) 



2. Nested PCR and Restriction enzyme cleavage 

The above RT PCR reaction solution was diluted into 1/50. 2ul of the diluted 
solution was mixed with 18ul of reaction solution containing 20mM of TrisHCl (pH 

15 8.4), 50mM KC1 5 0.2mM dNTP, 0.4 U Platinum Taq Polymerase (Invitrogen, 
10966-026), lOpmol of primers 9 and 10, lOpmol of primers 11 and 12 and lOpmol of 
primers 13 and 14. The following three types of PCR reaction and restriction enzyme 
treatment were performed using the mixture solution. The primers 9 and 10 were used 
in the reaction 1, the primers 1 1 and 12 in the reaction 2, and the primers 13 and 14 in 

20 the reaction 3. The PCR reaction temperature and time of the three reactions are as 
follows. 

94°C, 5min 

94°C, 30sec. 55°C, 30sec. 72°C, 30sec. (35 cycles) 
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72°C, 5min. 



1) Reaction 1 

The PCR was performed on the RT-PCR solution using the primers 9 and 10. 
5 The sequence of template DNA (5'->3 ') is as follows. 

CGTCTAGCCATGGCGTTAGTATGAGTGT CGTGCAGCCTCCAGGACCr. . 

(omitted) 

CTGCTAGCCGAGTAGTGTTGGGTCGCGAAAG GCCTTGTGGTACTGCCTGATA 
GGG (SEP ID NO: 19) 

10 

The underlined sequences are sites where the following primers 9 and 10 are 
hybridized. 

Primer 9 (SEQ ID NO: 20). 

5' - CGTCTAGCCATGGCGTTAGggatgATGAGTGT - 3' (32mer) 
15 Primer 10 (SEQ ID NO: 21) 

5' - CCCTATCAGGCAGTACCACAAGGC - 3' (24mer) 
The sequences of fragments produced by the PCR are as follows (5'->3'). 
CGTCTAGCCATGGCGTTAGggatgATGAGTGTCGTGCAGCCTCCAGGAC 
CC . . . (omitted) 

20 GCAGATCGGTACCGCAATCcctac TACTCACAGCACG TCGGAGGTCCTG 
GG . . . (omitted) 

. . .CTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTG 
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CCTGATAGGG (SEQ ID NO: 22) 

. . GACGATCGGCTCATCACAACCCAGCGCTTTCCGGAACACCATGAC 
GGACTATCCC (SEQ ID NO: 23) 

The sites represented by small letters are sequences recognized by Fokl and 
BstF5I. The underlined sites are sequences of fragments generated by restriction 
enzyme cleavage (7mer and 13mer). The PCR products were mixed with Fokl (NEB 
R109L) 1U, BstF5I (NEB, V0031L) 1U and lOul of reaction solution (50mM potassium 
acetate, 20mM Tris-acetate, lOmM magnesium acetate, ImM DTT). The mixture 
solution was reacted at 37°C for 2 hours, and at 45°C for 2 hours. The PCR products 
might be cleaved by Fokl at 37°C for 2 hours, and then by BstF5I at 45°C for 2 hours. 

2) Reaction 2 

The PCR was performed on the RT-PCR reaction solution using the primers 1 1 
and 12. The sequence of Template DNA (5'->3') is as follows. 

GTGGTCTGCGGAACCGGTGAGTACACCGGAAT TGCCAGGACGACCG 
GGTCC . . . (omitted) 

. . . CCCGCAAGACTG CTAGCCGAGTAGRGTTGGGTRGCGAA (SEQ ID 
NO: 24) 

The underlined sequences are sites where the following primers 11 and 12 are 
hybridized. 

Primer 1 1 (SEQ ID NO: 25). 

5' - GTGGTCTGtccaacCGGTGAGTACACCGGAAT - 3' (32mer) 
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Primer 12 (SEQ ID NO: 26). 

5' - TTCGCRACCCAACRCTACtccaacggtcCGGCTAG - 3' (35mer) 

The bases represented by R are adenine (A) or guanine (G). The mixture of 

two primers containing each base is used. 
5 The sequences of fragments generated through the PCR are as follows (5'->3'). 

GTGGTCTGtccaacCGGTGAGTACACCGGAATTG CCAGGACGACCGG GT 

CC . . . (omitted) 

CACCAGACaggttgGCCACTCATGTGGCCTTA ACGGTCCTGCTGGCCCAG 
G . . . (omitted) 

10 

. . . CCCCGC AAGACTGCTAGCCGg accgttggaGTAGRGTTGGGTRGCGAA 
(SEQ ID NO: 27) 

. . . GGGG CGTTCTGACGATCGGCctgg caacctCATCRCAACCCARCGCTT 
(SEQ ID NO: 28) 

15 

The sites represented by small letters are sequences recognized by Mmel and 
Avail, and the underlined sites are sequences of fragments generated by restriction 
enzyme cleavage (13mer, 18mer, 24mer and 19mer). The PCR products were mixed 
with Mmel (NEB R0637L) 1.5U, 50uM SAM and IX reaction solution (50mM 
20 potassium acetate, 20mM Tris-acetate, lOmM magnesium acetate, ImM DTT, pH 7.9) 
lOul. The mixture solution was reacted at 37°C for 2 hours, and then Avail (NEB, 
R0153S) 1.5U was added thereto. The resulting solution was reacted at 37°C for 2 
hours. Mmel and Avail might be simultaneously added to the mixture solution. 
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3) Reaction 3 

The PCR was performed on the RT-PCR reaction solution using the primer 13 
and 14. The sequence of Template DNA (5'-»3') is as follows. 

GACIGGGTCCTTTCTTGGA TCAACCCGCTCAATGCCTGGAG ATTTGGG 
5 CGTGCCCCCGC (SEQ ID NO: 29) 

The underlined sequences are sites where the following primers 13 and 14 are 
hybridized. 

Primer 13 (SEQ ID NO: 30). 5' - GACIGGGTCCTggatgTCTTGGA - 3' 
10 (23mer) 

Primer 14 (SEQ ID NO: 31). 5' - GCGGGGGCACggatgCCCAAAT - 3' 
(22mer) 

The bases represented by I are Inosine. 

1 5 The sequences of fragments generated by the PCR are as follows (5'->3'). 

GACIGGGTCCTggatgTC TTGGATCAACCCGCTCAATGCCTGGAGATTTG 
GGcatccGTGCCCCCGC (SEQ ID NO: 32) 

CTGICCCAGGAcctac AGGAACCTAGTTGGGCGAGTTACGGACCTCTAA 
ACCCgtaggCACGGGGGCG (SEQ ID NO: 33) 
20 The sites represented by small letters are sequences recognized by Fokl and 

BstF5I, and the underlined sites are sequences generated by restriction enzyme cleavage. 

The generated fragments are two 7mers, two 13mers, and two 14mers. The PCR 
products were mixed with Fokl (NEB R109L) 1U, BstF5I (NEB, V0031L) 1U and 10 ul 
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of reaction solution (50mM potassium acetate, 20mM Tris-acetate, lOmM magnesium 
acetate, ImM DTT). The mixture solution was reacted at 37°C for 2 hours, and at 
45°C for 2 hours. The fragments were cleaved first by Fokl at 37°C for 2 hours, and 
then by BstF5I at 45°C for 2 hours. 

5 

2. Purification & Desalination, and MALDI-TOF Mass Spectrometry 

Three types of PCR and restriction enzyme cleavage reaction solution were 
purified by the same method as that of Example 1, and then the molecular weight was 
measured. 

10 The size of fragments generated from Reactions 1 (Table 2), 2 (Table 3) and 3 

(Table 4) is shown in Tables 2-4. Genotypes of hepatitis C are determined according 
to the size of fragments by ready reckoner shown in Tables 2-4. 



[Table 2] 



Genotype 


7mer 


13mer 


la 


2216.4 


3983.6 


lb 


2216.4 


3983.6 


lc 


2216.4 


3983.6 


3a 


2216.4 


3983.6 


3b 


2216.4 


3983.6 


3c 


2216.4 


3983.6 


3d 


2216.4 


3983.6 


3e 


2216.4 


3983.6 


3f 


2216.4 


3983.6 


6b 


2216.4 


3983.6 


7a 


2216.4 


3983.6 


7b 


2216.4 


3983.6 
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7c 


2216.4 


3983.6 


2' 


2216.4 


3989.6 


5a 


2216.4 


3989.6 


lb 


2216.4 


3998.6 


Id 


2216.4 


3998.6 


le 


2216.4 


3998.6 


If 


2216.4 


3998.6 


2a 


2216.4 


3998.6 


2b 


2216.4 


3998.6 


2c 


2216.4 


3998.6 


2d 


2216.4 


3998.6 


2e 


2216.4 


3998.6 


2' 


2216.4 


3998.6 


4h 


2216.4 


3998.6 


6a 


2216.4 


3998.6 


7d 


2216.4 


3998.6 


lb 


2231.4 


3967.6 


4g 


2231.4 


3967.6 


4k 


2231.4 


3967.6 


2a 


2231.4 


3982.6 


4a 


2231.4 


3982.6 


4b 


2231.4 


3982.6 


4c 


2231.4 


3982.6 


4d 


2231.4 


3982.6 


4e 


2231.4 


3982.6 


4e' 


2231.4 


3982.6 


4f 


2231.4 


3982.6 


4f 


2231.4 


3982.6 



[Table 3] 
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Genotype 


13mer 


18mer 


Genotype 


14mer 


19mer 


la 


4049.6 


5556.6 


2a 


4337.8 


5891.8 


lb 


4049.6 


5556.6 


2e 


4337.8 


5891.8 


lc 


4049.6 


5556.6 


4b 


4337.8 


5891.8 


Id 


4049.6 


5556.6 


4e' 


4337.8 


5891.8 


le 


4049.6 


5556.6 


la 


4352.8 


5875.8 


If 


4049.6 


5556.6 


lc 


4352.8 


5875.8 


6b 


4049.6 


5556.6 


Id 


4352.8 


5875.8 


7a 


4049.6 


5556.6 


le 


4352.8 


5875.8 


7b 


4049.6 


5556.6 


2a 


4352.8 


5875.8 


4a 


4049.6 


5572.6 


2b 


4352.8 


5875.8 


6a 


4064.6 


5540.6 


2c 


4352.8 


5875.8 


7c 


4064.6 


5540.6 


2d 


4352.8 


5875.8 


7d 


4064.6 


5540.6 


2'-l 


4352.8 


5875.8 


4f 


4065.6 


5541.6 


2'-2 


4352.8 


5875.8 


4e' 


4064.6 


5556.6 


3a 


4352.8 


5875.8 


4f 


4065.6 


5557.6 


3b 


4352.8 


5875.8 


4g 


4065.6 


5557.6 


3d 


4352.8 


5875.8 


5a 


4080.6 


5525.6 


3e 


4352.8 


5875.8 


3b 


4080.6 


5541.6 


4c 


4352.8 


5875.8 


4b 


4080.6 


5541.6 


4d 


4352.8 


5875.8 


4c 


4080.6 


5541.6 


4f 


4352.8 


5875.8 


4d 


4080.6 


5541.6 


4f 


4352.8 


5875.8 


4e 


4080.6 


5541.6 


4g 


4352.8 


5875.8 


4h 


4080.6 


5541.6 


6a 


4352.8 


5875.8 


4k 


4080.6 


5541.6 


6b 


4352.8 


5875.8 


2d 


4088.6 


5515.6 


7c 


4353.8 


5876.8 


2b 


4088.6 


5530.6 


lb 


4368.8 


5860.8 


3f 


4096.6 


5526.6 


If 


4368.8 


5860.8 


2a 


4104.6 


5500.6 


3c 


4368.8 


5860.8 



27 



2c 


4104.6 


5500.6 


3f 


4368.8 


5860.8 


2e 


4104.6 


5500.6 


4a 


4368.8 


5860.8 


2' 


4104.6 


5500.6 


4e 


4368.8 


5860.8 


2' 


4104.6 


5500.6 


4h 


4368.8 


5860.8 


3a 


4111.6 


5510.6 


4k 


4368.8 


5860.8 


3c 


4111.6 


5510.6 


5a 


4368.8 


5860.8 


3d 


4111.6 


5510.6 


7a 


4368.8 


5860.8 


3e 


4111.6 


5510.6 


7b 


4368.8 


5860.8 








7d 


4368.8 


5860.8 
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[Table 4] 



HCVnt-140 Forward 


HCVnt-140 Reverse 


nt-140 Intra (14mer) 


Genotype 


7mer 


13mer 


Genotype 


7mer 


13mer 


Genotype 


Forward 


Reverse 


6a 


2191.4 


4053.6 


2'-2 


2144.4 


4110.6 


2d 


4247.8 


4392.8 


6b 


2191.4 


4053.6 


4f 


2144.4 


4110.6 


2a 


4247.8 


4408.8 


7d 


2191.4 


4053.6 


5a 


2144.4 


4110.6 


2c 


4247.8 


4408.8 


If 


2191.4 


4062.6 


la 


2144.4 


4125.6 


2e 


4247.8 


4408.8 


la 


2191.4 


4078.6 


lb 


2144.4 


4125.6 


2'-l 


4247.8 


4408.8 


lb 


2191.4 


4078.6 


lc 


2144.4 


4125.6 


2'-2 


4247.8 


4408.8 


le 


2191.4 


4078.6 


Id 


2144.4 


4125.6 


Id 


4248.8 


4368.8 


4a 


2191.4 


4078.6 


le 


2144.4 


4125.6 


If 


4287.8 


4368.8 


4b 


2191.4 


4078.6 


If 


2144.4 


4125.6 


3a 


4256.8 


4414.8 


4e' 


2191.4 


4078.6 


6a 


2144.4 


4125.6 


3c 


4256.8 


4414.8 


7a 


2191.4 


4078.6 


6b 


2144.4 


4125.6 


3e 


4256.8 


4414.8 


7b 


2191.4 


4078.6 


7a 


2144.4 


4125.6 


2b 


4562.0 


4714.0 


7c 


2191.4 


4078.6 


7b 


2144.4 


4125.6 


la 


4272.8 


4368.8 


3d 


2200.4 


4044.6 


7c 


2144.4 


4125.6 


lb 


4272.8 


4368.8 


4g 


2200.4 


4044.6 


7d 


2144.4 


4125.6 


lc 


4272.8 


4368.8 


4k 


2200.4 


4053.6 


3 a 


2159.4 


4078.6 


le 


4272.8 


4368.8 


3b 


2200.4 


4069.6 


3c 


2159.4 


4078.6 


4a 


4272.8 


4368.8 


3c 


2200.4 


4069.6 


3d 


2159.4 


4078.6 


4e' 


4272.8 


4368.8 


3e 


2200.4 


4069.6 


3e 


2159.4 


4078.6 


7a 


4272.8 


4368.8 


4e 


2206.4 


4037.6 


3b 


2159.4 


4094.6 


7b 


4272.8 


4368.8 


lb 


2206.4 


4062.6 


3f 


2159.4 


4094.6 


3d 


4570.0 


4744.0 


lc 


2206.4 


4062.6 


4c 


2159.4 


4094.6 


3f 


4546.0 


4744.0 


T-2 


2206.4 


4062.6 


4d 


2159.4 


4094.6 


3b 


4272.8 


4384.8 


4f 


2206.4 


4062.6 


4e 


2159.4 


4094.6 


4b 


4272.8 


4384.8 


5a 


2206.4 


4062.6 


4f 


2159.4 


4094.6 


4c 


4272.8 


4384.8 


lb 


2215.4 


4053.6 


4h 


2159.4 


4094.6 


4d 


4272.8 


4384.8 


2a 


2215.4 


4053.6 


4k 


2159.4 


4094.6 


4f 


4272.8 


4384.8 
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2b 


2215.4 


4053.6 


4a 


2159.4 


4109.6 


5a 


4272.8 


4384.8 


2c 


2215.4 


4053.6 


4e' 


2159.4 


4109.6 


4e 


4296.8 


4384.8 


2d 


2215.4 


4053.6 


4b 


2184.4 


4070.6 


4g 


4586.0 


4714.0 


2e 


2215.4 


4053.6 


4g 


2184.4 


4070.6 


4k 


4287.8 


4384.8 


2'-l 


2215.4 


4053.6 


2a 


2193.4 


4061.6 


4f 


4562.0 


4384.8 


3f 


2215.4 


4053.6 


2b 


2193.4 


4061.6 


4h 


4562.0 


4384.8 


4c 


2215.4 


4053.6 


2e 


2193.4 


4061.6 


6a 


4586.0 


4698.0 


4d 


2215.4 


4053.6 


2d 


2193.4 


4076.6 


6b 


4586.0 


4698.0 


4f 


2215.4 


4053.6 


2c 


2209.4 


4046.6 


7d 


4586.0 


4698.0 


4h 


2215.4 


4053.6 


2'-l 


2209.4 


4046.6 


lb 


4288.8 


4353.8 


3a 


2216.4 


4054.6 


2c 


2209.4 


4030.6 


7c 


4288.8 


4353.8 


Id 


2215.4 


4078.6 















TEFFECTS OF THE INVENTION] 
5 In an embodiment of the present invention, the analysis misled by errors in the 

conventional method for detecting mutations may be identified, and various mutations 
of adjacent base within 32 bases may be simultaneously examined. When there are 
various genotypes in an individual having mutations, whether mutations in different 
sites exist in one genotype at the same time or exist with mixed in two or more 
10 genotypes may be distinguished. In addition, mutations resulting from deletion or 
insertion can be detected. 
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